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Some of the new features of the symbolic manipulation system FORM are discussed. Then some recent results 
running its multithreaded version TFORM are shown. Finally the plans for the future are presented. 



1. New Features 

Over the past few years the symbolic manip- 
ulation system FORM[T] has picked up a num- 
ber of new features. Some arc designed for bet- 
ter speed, some for more convenience. The most 
recent facility is the set of transform statements 
for manipulating functions with a large number 
of arguments. The problem here is that doing 
the arguments one by one often involves a large 
number of statements, and always a superfluous 
amount of pattern matching. Just imagine we 
have a function f with 20 arguments. All have a 
value of either zero or one. Now we want to re- 
place the zeroes by ones and the ones by zeroes. 
One way to do this is with 

Multiply fl; 

repeat id f (x? , ?a) *f 1 (?b) = 

f (?a)*fl(?b,l-x) ; 
id f*f l(?a) = f (?a) ; 

Note that the repeat loop has to be done 20 times. 
With the new transform statement we would have 

Transform , f , replace ( 1 , last )=(1, 0,0,1) ; 

On my laptop the first method takes 45.20/zsec. 
and the second method takes 1.38/zsec. In addi- 
tion the code is easier to understand. This gives 
a smaller chance of errors. 

One can have different subkeys and one state- 
ment can have a whole chain of operations as in 

CF H; 

L F = H(3, 4, 2, 6, 1,1, 1,2) ; 

Transform, H,tointegralnotation(l ,last) , 
replaced, last) = (0, 1,1,0) , 
encoded ,last) :base=2; 



Print; 
. end 

F = 

HO07202) ; 

One can also split the transform statement in 
various statements if one would like to see what 
happens: 

CF H; 

Off Statistics; 

L F = H(3, 4, 2, 6, 1,1, 1,2) ; 

Print "<1> 7.t"; 

Transf orm,H,t ointegralnot at ion d , last) ; 
Print "<2> %t" ; 

Transf orm,H, replace (1 , last)=(0 ,1,1,0); 
Print "<3> 7.t"; 

Transform , H , encode ( 1 , last) : base=2 ; 
Print "<4> 7.t"; 
. end 

<1> + H(3, 4, 2, 6, 1,1, 1,2) 
<2> + H(0, 0,1, 0,0, 0,1, 0,1, 0,0, 0,0, 0,1,1 

,1,1,0,1) 

<3> + H(l, 1,0, 1,1, 1,0, 1,0, 1,1, 1,1, 1,0,0 

,0,0,1,0) 

<4> + HO07202) 

The old style program would have been 

#: Workspace 10M 
Symbol x,xl,x2; 
CF H,H1; 

L F = H(3, 4, 2, 6, 1,1, 1,2); 
repeat id H(?a,x? ! {0 , 1} , ?b) = 

H(?a,0,x-l,?b) ; 

Multiply HI; 
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repeat id H(x? , ?a) *H1 (?b) = 

H(?a)*Hl(?b,l-x) ; 
id H*Hl(?a) = H(?a) ; 
repeat id H(xl?,x2?,?a) = 

H(2*xl+x2,?a) ; 

Print ; 
. end 

F = 

HO07202) ; 

The relevant code here takes 130/xsec. while the 
composite transform statement takes 1.82/isec. In 
addition the last code needs commentary if one 
would like to know what it actually does. 

The transform statement also allows any per- 
mutations of arguments. In the example here we 
have cyclic permutations, the first is one back- 
wards and the second is two forwards: 

CF fl,f2; 

S a,b , c , d, e ; 

L F = f l(a,b,c,d,e)* 

f2(3,2*a,4,c,l,2,3); 
Transform, f 1 , cycled ,last)=-l ; 
Transform , f 2 , cycle ( 1 , last ) =+2 ; 
Print; 
. end 

F = 

f l(b,c,d,e,a)*f2(2,3,3,2*a,4,c,l) ; 

There are also facilities for Lyndon [2] words of 
arguments as this is usually messy to program 
externally: 

Symbol x,xl,x2; 

CF H,Hl,f; 

Off Statistics; 

L F = H(3, 4, 2, 6, 1,1, 1,2) 

+H(6, 1,1, 1,2,3,4,2) 

+H(4, 3, 2, 1,4,3,2,1) 

+H(4, 3, 2, 1,4,2,2,2) 

+H(4,2,2,2,4,3,2,1) 

+H(1, 1,1,6,2,4,3,2) 

+H(2,4,3,2,1,1,1,6) ; 
Tr ansf orm , H , t oLyndon> ( 1 , last ) = 

(f(l),f(0)); 

Print +s; 



. end 
F = 

+ 2*H(4,3,2,l,4,2,2,2)*f (1) 
+ H(4,3,2, 1,4,3,2, l)*f (0) 
+ 2*H(6,l,l,l,2,3,4,2)*f (1) 
+ 2*H(6,2,4,3,2,l,l,l)*f (1) 

The term is multiplied by f ( 1 ) when it is a Lyn- 
don word and by f (0) when it is not. If we would 
have put just (1,0) at the end of the transform 
statement the non-Lyndon term would have been 
absent in the output. 

The same program, but now with the ordering 
'smallest first': 

Symbol x,xl,x2; 

CF H,Hl,f; 

Off Statistics; 

L F = H(3, 4, 2, 6, 1,1, 1,2) 

+H(6, 1,1, 1,2,3,4,2) 

+H(4, 3, 2, 1,4,3,2,1) 

+H(4, 3, 2, 1,4,2,2,2) 

+H(4,2,2,2,4,3,2,1) 

+H(1, 1,1,6,2,4,3,2) 

+H(2,4,3,2,1,1,1,6) ; 
Tr ansf orm , H , t oLyndon< ( 1 , last ) = 

(f (l),f (0)); 

Print +s; 
. end 

F = 

+ 2*H(l,l,l,2,3,4,2,6)*f (1) 
+ 2*H(l,l,l,6,2,4,3,2)*f (1) 
+ 2*H(l,4,2,2,2,4,3,2)*f (1) 
+ H(l,4,3,2,l,4,3,2)*f (0) 

Large runs suffer from the problem that the 
computer may not be up that long. Jens Vollinga 
has worked at a checkpoint facility which allows 
the user to make 'snapshots' before the start of a 
module. If FORM crashes, for instance due to a 
power outage, one can restart at the beginning of 
that module. 

Currently this is still being debugged and 
tuned. For some applications it is still too slow. 
There are also still some childhood diseases, but 
things improve. 
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TF0RM[3] has been improved a bit. The mas- 
ter needs far less time when there are brackets and 
the brackets have been indexed. In that case the 
master can tell the workers to deal with complete 
brackets. From that point on each worker is re- 
sponsible for finding the terms of the brackets on 
its own. In the old setup, the master has to read 
all terms and put them in the 'buckets', before 
giving the buckets to the workers. The speedup 
is noticeable. This still leaves the bottlenecks at 
the end of the sorting. There exist algorithms 
that might be able to deal with this but they are 
rather complicated. They are planned for a future 
upgrade. 

Last but not least: there have been numerous 
bug fixes. For this many thanks to the people who 
provide me with concise bug reports that allow 
me to catch these bugs. 

2. Something to boast about 

One of the great testjobs during the develop- 
ment over the past few years has been the expres- 
sion of Multiple Zeta Values [4|5|6] in terms of a 
minimal basis. This is mainly a matter of solving 
a system of linear equations in which the coeffi- 
cients in the homogeneous part are rational num- 
bers and the inhomogeneous part contains sums 
and products of basis elements of a lower weight 
with sometimes rather bad rational coefficients. 
In the worst case the number of equations may 
run in the millions and the number of unknows 
can be around 1 million or more. 

The worst run thus far took 69 days on the 
8 cores of one of the nodes of the computer in 
Karlsruhe and verified the conjecture that a new 
type of basis element was going to enter. 

One of the statistics in this program: 

Time = 69738.22 sec Generated terms= 

6768912520814 
FF Terms in output= 

2563910243 
substitution (8-sh) -4544 Bytes used 

61564939480 

The total number of generated terms in the job 
was 28,710,904,088,430 which is 600,000 terms 
per second per core. 



The number of variables increases with 2 W ~ 3 . 
Before this project was started, the mathemati- 
cians had gotten to w = 18 and for w = 19 and 
w = 20 they had used matrix techniques to de- 
termine only the size of the basis. Now we have a 
full basis up to w = 26 and w — 28. For w = 27 
we still miss two basis elements but we can guess 
them. 

At the same time we studied something discov- 
ered earlier by Broadhurst[7], called pushdowns in 
which basis elements of the MZV's could be ex- 
pressed in terms of alternating sums with fewer 
indices as in: 

64 7967 1 4 

^6,4,1,1 - -27^,5-^^9,3 + ^3 

11431 799 
+ T296" CrC5 " ~T2 C ^ + 3C2Z7 < 3 

+ ^C 2 C 5 2 + 10C2C7C3 + ~Cf ^5,3 

-lc*c c - -c 3 c 2 - 560785 V 

5 1,2 1,51,3 35^3 6081075^ 2 

with A 7 , 5 = Hj^ - H-7 y5 . 

In ref [6] we managed to locate 16 of such re- 
lations in a combined symbolic (FORM) and nu- 
merical (PSLQ) effort. 

In table 1 we give the numbers of basis elements 
of a certain type: the number of elements belong- 
ing to the set of Lyndon words with odd integers 
greater than 1 (and adding up to the weight), the 
number of such elements in which the first two in- 
dices have been lowered by one and two ones have 
been added, and finally the number of such ele- 
ments in which the first four elements have been 
lowered by one and 4 ones have been added. As 
in 

#5,3,5,3,5,3,3 
#7,5,3,3,3,3,3 — > #6,4,3,3,3,3,3,1,1 
#7,5,7,5,3 - ► #6,4,6,4,3,1,1,1,1 

In the left and top of table 1 we have verified that 
the number of elements in which the first two in- 
dices have been lowered by one and two ones have 
been added corresponds exactly to the number of 
pushdowns. For the underlined numbers we can 
determine a basis but testing explicit pushdowns 
is beyond our reach. The number in the box cor- 
responds to the new run in which we found the 
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Tabic 1: Number of MZV basis elements as a function of weight and depth 



new basis element for which the first four indices 
are lowered by one and four indices one have been 
added. The case with W — 27, D = 9 is currently 
running. 

The fact that this system of basis construction 
and the pushdowns give the same numbers is very 
suggestive. 



3. Future Features 

When we arc looking towards the future we 
should first consider who are doing the work. I 



have compiled a list of people who have and are 
working at FORM during various stages. It is 
shown in table 2. 

Then there are of course the beta testers who 
sometimes put in much work to produce a con- 
cise bug report or who come up with useful 
suggestions. A number of the most important 
ones are Ettore Remiddi, Kostia Chetyrkin, York 
Schroder, Thomas Hahn, Takahiro Ueda and Pe- 
ter Uwer. My apologies if I forget people here. 
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JV 1984-now 
Geert Jan van Oldenborgh manual(90's) 

Andre Heck manual(90's) 

Albert Retey 1997-2000 

Denny Fliegner 1998-2000 

Markus Frank 2000 

Andrei Onishchenko 2000-2002 

Misha Tentyukov 2002-now 

Jens Vollinga 2007-now 

Thomas Reiter 2008-now 

Irina Pushkina 2009-now 

Jan Kuipers 2009-now 

Table 2: People who have worked at FORM 



3.1. Open Source 

Sometimes one would like to have quick pri- 
vate additions for things that are extremely hard 
to program at the FORM level. Such things are 
often either of combinatoric nature or special pat- 
terns. It is of course impossible to forsee what 
some people will need. Hence FORM should be 
structured in such a way that it is possible to 
make such additions oneself, even though this 
won't be for beginners. The first requirement for 
this is a good documentation of the inner work- 
ings, including a number of examples. The second 
requirement is code that can be understood and 
is structured properly. Due to these two require- 
ments FORM hasn't been released yet as open 
source. We hope to be rectify this by this sum- 
mer. Jens Vollinga is working hard at it. 

3.2. Rational Polynomials 

Systems of equations that need to be solved are 
asking often for capabilities with rational polyno- 
mials. Most notoriously are the Laporta [8|9j al- 
gorithms. This is something that FORM doesn't 
have currently. Hence it has rather high priority 
to build this in. And to build this in in a rather 
efficient way as belonging to FORM. There exist 
libraries for the manipulation of polynomials in a 
single variable, some of them claiming great ef- 
ficiency, but there are no equivalent libraries for 
polynomials in many variables. In addition there 
is the problem of notation. Too much time spent 



on conversion will not be beneficial. Some par- 
tial code exists. Most univariate algorithms (in 
particular the GCD) have been implemented in 
various methods. This is by now reasonably fast. 
Factorization is completely missing. Jan Kuipers 
is working on this and also the multivariate cases. 

It is important to deal with multivariate ratio- 
nal polynomials efficiently when one likes to cre- 
ate a system for computing Grobner bases. There 
are however several ways to deal with polynomials 
and each way needs its own solution: 

• Small polynomials: when they take a small 
amount of space they can be kept inside the 
argument of a function. There may be bil- 
lions of such polynomials. They should be 
treated inside the regular workspace. Uni- 
variate polynomials will usually be in this 
category. 

• Intermediate polynomials: these could be 
handled by means of memory allocations as 
is done with the dollar variables. One could 
have hundreds or even thousands of them. 
Typically not billions. 

• Large polynomials: These are complete ex- 
pressions that could have billions of terms. 
Calculating their GCD would have to use 
the same mechanisms by which expressions 
are treated. There should be only very few 
of these. 

Symbols x,y; 
CFunction pace; 
PolyRatFun pace; 

L F=pacc (x~2+x-3 , (x+1) * (x+2) ) *y 
pace (x~2+3*x+l , (x+3) * (x+2) ) *y~2 

Print +s; 
. sort 



+y*pacc (x~2+x-3 , x~2+3*x+2) 
+y~2*pacc(x~2+3*x+l ,x~2+5*x+6) 



id y = 1; 
Print; 
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. end 
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pacc(2*x~2+4*x-4,x~2+4*x+3) ; 
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